Mining Unusual Patterns by Multi-Dimensional Analysis of Data Streams

نویسنده

  • Jiawei Han
چکیده

It has been popularly recognized that stream data represents an important form of data, with broad applications. There have been a lot of studies on effective stream data management and query processing, as well as some recent studies on stream data mining. Although this is a promising direction, most existing studies have not paid enough attention to one critical fact: most data streams reside at a rather low level of abstraction and are multi-dimensional in nature, whereas most analysts are interested in finding characteristic features, unusual patterns, and dynamic changes (such as trends and outliers) at relatively high levels of abstraction and in certain multi-dimensional space. To accomplish such tasks, one may need to develop effective mechanisms for on-line, multi-dimensional analysis and mining of stream data. This poses great challenges on system architecture, implementation methodology, algorithm development, and performance tuning. In this paper, we discuss the issues related to effective, on-line, multi-dimensional analysis and mining of unusual events and patterns in data streams, including research challenges, potential architectures, and implementation methodologies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows

Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...

متن کامل

Proposing an approach to calculate headway intervals to improve bus fleet scheduling using a data mining algorithm

The growth of AVL (Automatic Vehicle Location) systems leads to huge amount of data about different parts of bus fleet (buses, stations, passenger, etc.) which is very useful to improve bus fleet efficiency. In addition, by processing fleet and passengers’ historical data it is possible to detect passenger’s behavioral patterns in different parts of the day and to use it in order to improve fle...

متن کامل

Mining multi-dimensional concept-drifting data streams using Bayesian network classifiers

In recent years, a plethora of approaches have been proposed to deal with the increasingly challenging task of mining concept-drifting data streams. However, most of these approaches can only be applied to uni-dimensional classification problems where each input instance has to be assigned to a single output class variable. The problem of mining multi-dimensional data streams, which includes mu...

متن کامل

A Sliding Window Algorithm for Relational Frequent Patterns Mining from Data Streams

Some challenges in frequent pattern mining from data streams are the drift of data distribution and the computational efficiency. In this work an additional challenge is considered: data streams describe complex objects modeled by multiple database relations. A multi-relational data mining algorithm is proposed to efficiently discover approximate relational frequent patterns over a sliding time...

متن کامل

Multi-Dimensional Analysis of Data Streams Using Stream Cubes

Large volumes of dynamic stream data pose great challenges to its analysis. Besides its dynamic and transient behavior, stream data has another important characteristic: multi-dimensionality. Much of stream data resides at a multidimensional space and at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes in some combination of dimensio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002